Optimum estimator in simple random sampling using two auxiliary attributes with application in agriculture, fisheries and education sectors

In modern age of information technology, data is available everywhere in huge amount. Every sector generates lot of data every day. The investigation of each unit of data is not feasible due to limited resources like time, labor, and cost. In such situations, survey sampling is recommended to draw the information about the population parameters. Therefore, the main objective of present study is to develop an estimation method for obtaining the information about population parameter. We propose an optimum estimator for enhanced estimation of population mean in simple random sampling by utilizing the information of the two auxiliary attribute. The expression for bias, mean squared error (MSE) and minimum mean squared error of the proposed estimator are derived up to the first order of approximation and it is shown that the proposed estimator under derived conditions perform better than the existing estimators theoretically. Four population are demonstrated to assess the performance as well as applicability of the proposed estimator. The percentage relative efficiency (PRE) of proposed estimator for all the populations is 209.533, 163.852, 210.398 and 340.578, respectively. The numerical illustrations confirm that the proposed estimator dominates over the existing estimators.• The main objective of present study is to propose a new estimator/method for estimation of population mean using two auxiliary attributes under simple random sampling.• The bias and mean square error of the proposed estimator/method is derived and compared with the existing estimators to compare the efficiency theoretically.• Applications of the proposed method/estimator is highlighted using thorough the real data sets of various sectors.


Specifications table
Subject Area: Mathematical Statistics More specific subject area: Sample Surveys Method name: Optimum estimator for estimating population mean using two auxiliary attributes Name and references of original method: The proposed method is motivated by following references: Resource availability: Data utilized in the analysis is available in public domain.

Background
In sample surveys, it is well documented in Cochran [20] that the use of supplementary information provided by auxiliary variables or attributes is frequently used for increasing the precision of the estimators by taking the advantages of correlation between the study variable and auxiliary variable. Regression, ratio, and product estimators are good examples in this perspective. Though Cochran [1] investigated that ratio estimator is best suited when the study and auxiliary variable are highly positively correlated whereas in case of highly negative correlated variables, product method of estimation is better. In many real-life situations, the study variable is not always quantitative in nature. The responses recorded from respondents are qualitative in such situations the recorded information is called attributes. Several studies like Shabbir and Gupta [22] and Abd-Elfattah et al. [23] have been conducted to improve the precision of the estimator by utilizing the auxiliary attributes having applications in agriculture, health science, fisheries, power engineering etc. It is also investigated that use of more than one auxiliary attribute enhance the efficiency of the estimator. Singh et al. [2] used auxiliary attribute information for establishment of ration estimators in simple random sampling (SRS). The bias and mean square error of the estimator has been computed for existing data set available in literature. It is developed as a modified estimator of Koyuncu and Kadilar [21] estimator and proved that it outperformed the existing estimators. Singh and Kumar [4] used auxiliary information to estimate improved regression estimator for SRS. Here the auxiliary information is qualitative, and concept of non-response incorporated in estimation. Malik and Singh [3] initiated the use of two auxiliary variables in estimation of population mean and proposed enhanced estimators. Here, auxiliary information available in qualitative form and this estimator performed better than simple regression estimator. Ekpenyong and Enang [5] suggested better exponential estimators in SRS for estimating population mean. The concept of simple random sampling without replacement used for development of estimator. Lu [6] explored the applications of estimators developed under auxiliary information in agriculture and power engineering sectors. The estimator compared with regression estimator and other existing estimators and proved efficient. Zaman and Kadilar [7] utilized auxiliary information for development of a novel family of exponential estimators. Ahmad et al. [8] carried out the generalization of exponential ratio estimators under auxiliary estimators. The estimator was exponential-based estimator while estimator proposed by us is mixed type estimator while estimator proposed in present study is a mixture of simple, ratio and product estimators. Mahajan et al. [9] explored the applications of estimation and sample surveys in agriculture and health sciences. Kumar and Saini [10] suggested a predictive approach to estimate the population mean under auxiliary attribute. Yunusa et al. [11] utilized auxiliary variables in development of regression type estimators to estimation the population mean. Rather et al. [24] used auxiliary information for development of a mixed exponential ratio type estimator for estimating the population mean. The simple random sampling and double sampling techniques utilized for selection of the sample. Zaman et al. [25] proposed an exponential type estimator for assessing and estimating the COVID-19 risk in various countries. Two multivariate families of exponential type estimators proposed by utilizing the information on two auxiliary variables. Many authors like Wayangkau et al. [18] , Waheeb et al. [15] , Rajak [16] , and Jabal et al. [17] discussed some other data analysis techniques for analysis of agricultural information. The above cited literature motivates to explore the applicability of two auxiliary attributes in estimation of mean of various populations associated with agriculture, fisheries, and education sectors. The primary goal of this paper is to propose a novel optimum estimator for estimating finite population mean using auxiliary attributes. The expressions for the bias and mean square error (MSE) of the proposed estimator are inferred up to the first order of approximation. On the bases of theoretical and numerical comparisons, we demonstrate that the proposed estimator is more efficient than existing estimators.

Material and methods
Consider γ = ( γ 1 , γ 2 , γ 3 , . . . ., γ N ) be a finite population of size N. we draw a sample of size n (with n < N ) units from γ using simple random sample without replacement (SRSWOR).
be the population variance of the study variable y. Let S 2 respectively be the population variance of the auxiliary attributes d 1 and d 2 .
Let C y = S y Ȳ be the coefficient of variation of the study variable y.
be the coefficient of variation of the auxiliary d 1 and d 2 .
be the population covariance between the study variable y and the auxiliary attributes be the population covariance between the auxiliary attributes d 1 and d 2 . Let be the population point bi-serial correlation coefficient between the study variable y and the auxiliary attribute The stepwise framework of proposed method is described as follows: Step 1: Consider a finite population of size N.
Step 2: Select a random sample of size n from the population using simple random sampling without replacement.
Step 3: Observe y i and ẟ i from sampling units.
Step 4: Define the expressions for population and sample characteristics.
Step 5: Propose the estimator for estimating population mean using two auxiliary attributes and derive its properties.
Step 6: Compare the proposed estimator with the existing estimators theoretically and numerically.

Existing estimators
The most widely used estimator discussed by Cochran [1] of population mean Ȳ of the study variable, is given by sample mean is an unbiased estimator of population mean and upto the first order of approximation the variance or MSE is given by (2) Naik and Gupta ( ˆ μ 1 ) Naik and Gupta [12] proposed the following ratio estimator of population mean Ȳ when the population proportion D 1 of auxiliary attribute is known The bias and MSE of ˆ μ 1 to the first order of approximation is given by, Naik and Gupta ( ˆ Naik and Gupta [12] proposed the following product estimator of population mean Ȳ when the population proportion D 1 of auxiliary attribute is known The bias and MSE of ˆ μ 2 , to the first order of approximation is given by Singh et al. [2] suggested an exponential type ratio estimator for estimating population mean Ȳ using the population proportion D 1 of auxiliary attribute is known The bias and MSE of ˆ μ 3 , to the first order of approximation, Singh et al. [2] suggested the following product estimators for estimating population mean Ȳ when the population proportion D 1 of auxiliary attribute is known Similarly, the bias and MSE of ˆ μ 4 , is given by Kumar and Bhougal ( ˆ Kumar and Bhougal [13] proposed an exponential type of ratio-product estimator for estimating population mean Ȳ when the population proportion D 1 of auxiliary attribute is known where α is unknown constant.
The bias and MSE of ˆ μ 5 to the first order of approximation is given by The optimum value of α is given by Singh and Kumar ( ˆ Singh and Kumar [4] suggested ratio estimator for estimating population mean Ȳ when the population proportion D 1 and D 2 of auxiliary attribute are known The bias and MSE of ˆ Singh and Kumar ( ˆ μ 7 ) Singh and Kumar [4] suggested product estimator for estimating population mean Ȳ when the population proportion D 1 and D 2 of auxiliary attribute are known The bias and MSE of ˆ μ 7 to the first order of approximation are given by Ahmed et al. [8] proposed a generalized class of factor type of estimators for estimating population mean Ȳ when the population proportion D 1 and D 2 of auxiliary attribute are known is given by where, The bias and MSE of ˆ μ 8 to the first order of approximation are given by The minimum MSE of ( ˆ μ 8 ) to the first order of approximation is given by where The optimum value of σ 10 pt and σ 20 pt are
In Eq. (21) we will neglect the terms of π 's, power having greater than two, we get To get the bias of the proposed estimator, we need to take expectation on both the sides of (22) , hence we will get the bias of the proposed estimator up to the first order of approximation ( Fig. 1 ).
The bias of the proposed estimator is given by Now for deriving the MSE of the proposed estimator, let us square both sides of (22) , neglecting terms π 's having power greater than two, and taking expectation on both sides, after simplification we get, To obtain the optimum value of ω 1 , ω 2 and ω 3 , we partially differentiate the Eq. (23) with respect to ω 1 , ω 2 and ω 3 and put it equal to zero, the optimum values of ω 1 , ω 2 and ω 3 are given by The minimum MSE of ˆ μ prop can be shown as,

Results and discussion
To examine the dominance and applicability of proposed estimator in simple random sampling, we considered four real data sets from agricultural, fishers and education sectors available in Ahmad et al. [8] . A similar kind of population for reference is given as Appendix A .

Population 1 . (Source from education sector)
Let y represent the number of instructors, d 1 the total number of primary and secondary school students in Turkey in 2007, which was larger than 11,440.5, and d 2 the total number of primary and secondary school students in Turkey in 2008, which was greater than 333.1647. The population information about data set is given as:   We compute the MSE values of existing estimates and proposed estimator using Eqs. (2) , (4) , (6) , (8) , (10) , (12) , (14) , (16) , (18) and (24) and these values are shown in Table 1 .
To measure the Percentage Relative Efficiency (PRE), we apply the following formula: The numerical comparison of PRE for existing and proposed estimator is shown in Table 2 .
The proposed estimator is compared with the eight estimators as shown in Section 2 and results shown numerically and graphically. It is observed from  [8] . It is observed from Figs. 1 and 2 that proposed estimator is more efficient for population mean estimation in comparison to the existing estimators. In theory of sample surveys, the major objective of any statistician or investigator is to minimize the MSE and maximize the PRE in the estimation to ideally draw an inference about the study population. The proposed estimator is validated with the help of minimum MSE and maximum PRE. It attains the minimum MSE and

Conclusion
In this article, we proposed an optimum estimator for estimation of population mean in simple random sampling by utilizing the information of the two auxiliary attribute. Mathematical expressions of the proposed estimator i.e. bias, mean squared error (MSE) and minimum mean squared error are derived. Mean square error of proposed estimator is shown in Section 3. The proposed estimator is efficient under the derived conditions discussed in Section 4. The implementation of proposed estimator helps in estimation of précised value of the population mean on the basis of which effective decisions can be made. It is recommended that the proposed estimator may be used in sample surveys in the areas of education, agriculture fisheries and health sciences, etc. Further, the proposed estimator can be extended to utilization of multi auxiliary information under various sampling designs.

Declaration of Competing Interest
There is no conflict of interest among the authors.

Data availability
Data is available in public domain.  The population information about data set is given as: